Corpus Language
   HOME

TheInfoList



OR:

A corpus language is a language that has no living speakers, though a number of the actual productions of the native speakers have been preserved in some way (usually in written records).Langslow, D.R. 2002 "Approaching bilingualism in corpus languages" in James Noel Adams, Mark Janse, Simon Swain (edd.) ''Bilingualism in Ancient Society: Language Contact and the Written Text'' Oxford: OUP Examples of corpus languages are
Ancient Greek Ancient Greek includes the forms of the Greek language used in ancient Greece and the ancient world from around 1500 BC to 300 BC. It is often roughly divided into the following periods: Mycenaean Greek (), Dark Ages (), the Archaic peri ...
,
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
, the
Egyptian Language The Egyptian language or Ancient Egyptian ( ) is a dead language, dead Afroasiatic languages, Afro-Asiatic language that was spoken in ancient Egypt. It is known today from a large Text corpus, corpus of surviving texts which were made acces ...
,
Old English Old English (, ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the early Middle Ages. It was brought to Great Britain by Anglo-Saxon settlement of Britain, Anglo ...
and
Elamite Elamite, also known as Hatamtite and formerly as Susian, is an extinct language that was spoken by the ancient Elamites. It was used in what is now southwestern Iran from 2600 BC to 330 BC. Elamite works disappear from the archeological record ...
. Some corpus languages left a very large corpus, like
Ancient Greek Ancient Greek includes the forms of the Greek language used in ancient Greece and the ancient world from around 1500 BC to 300 BC. It is often roughly divided into the following periods: Mycenaean Greek (), Dark Ages (), the Archaic peri ...
and
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
, and therefore can be totally reconstructed, even though some details of the pronunciation may be unclear. Such languages can be used even today, as is the case with
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
and Latin. Others have such a limited corpus that some important words, e.g. some pronouns, are not found in the corpus. Examples for this are
Ugaritic Ugaritic () is an extinct Northwest Semitic language, classified by some as a dialect of the Amorite language and so the only known Amorite dialect preserved in writing. It is known through the Ugaritic texts discovered by French archaeologis ...
and
Gothic Gothic or Gothics may refer to: People and languages *Goths or Gothic people, the ethnonym of a group of East Germanic tribes **Gothic language, an extinct East Germanic language spoken by the Goths **Crimean Gothic, the Gothic language spoken b ...
. Languages that are only attested by a few words, often names, and a few phrases (called ''Trümmersprachen'' in German linguistics, literally "rubble languages") can only be reconstructed in a very limited way and often their genetic relationship to other languages remains unclear. Examples are the
Lombardic language Lombardic or Langobardic is an extinct West Germanic language that was spoken by the Lombards (), the Germanic people who settled in Italy in the sixth century. It was already declining by the seventh century because the invaders quickly adopted ...
and
Dadanitic Dadanitic is the script and possibly the language of the oasis of Dadān (modern Al-'Ula) and the kingdom of Liḥyān in northwestern Arabia, spoken probably some time during the second half of the first millennium BCE. Nomenclature Dadanitic ...
, a
Semitic language The Semitic languages are a branch of the Afroasiatic language family. They are spoken by more than 330 million people across much of West Asia, the Horn of Africa, and latterly North Africa, Malta, West Africa, Chad, and in large immigrant a ...
that may be close to
classical Arabic Classical Arabic ( ar, links=no, ٱلْعَرَبِيَّةُ ٱلْفُصْحَىٰ, al-ʿarabīyah al-fuṣḥā) or Quranic Arabic is the standardized literary form of Arabic used from the 7th century and throughout the Middle Ages, most notab ...
. Corpus languages are studied using the methods of
corpus linguistics Corpus linguistics is the study of language, study of a language as that language is expressed in its text corpus (plural ''corpora''), its body of "real world" text. Corpus linguistics proposes that a reliable analysis of a language is more feas ...
, but corpus linguistics can be used (and is commonly used) for the study of the recorded productions of living languages. Not all
extinct language An extinct language is a language that no longer has any speakers, especially if the language has no living descendants. In contrast, a dead language is one that is no longer the native language of any community, even if it is still in use, li ...
s are "corpus languages," since many languages have disappeared leaving no, or very inadequate, recorded production of their speakers.


References


See also

{{Portal, Languages *
Endangered language An endangered language or moribund language is a language that is at risk of disappearing as its speakers die out or shift to speaking other languages. Language loss occurs when the language has no more native speakers and becomes a "dead langu ...
*
Language death In linguistics, language death occurs when a language loses its last native speaker. By extension, language extinction is when the language is no longer known, including by second-language speakers. Other similar terms include linguicide, the deat ...
Linguistics Historical linguistics Corpus linguistics Extinct languages